1 ATGCCCTTTT GCCACAATAT AATTAATATT TCCTGTGTGA AAAACAACTG 

51 GTCAAATGAT GTCCGTGCTT CCCTGTACAG TTTAATGGTG CTCATAATTC 

101 TGACCACACT CGTTGGCAAT CTGATAGTTA TTGTTTCTAT ATCACACTTC 

151 AAACAACTTC ATACCCCAAC AAATTGGCTC ATTCATTCCA TGGCCACTGT 

201 GGACTTTCTT CTGGGGTGTC TGGTCATGCC TTACAGTATG GTGAGATCTG 

251 CTGAGCACTG TTGGTATTTT GGAGAAGTCT TCTGTAAAAT TCACACAAGC 

301 ACCGACATTA TGCTGAGCTC AGCCTCCATT TTCCATTTGT CTTTCATCTC 

351 CATTGACCGC TACTATGCTG TGTGTGATCC ACTGAGATAT AAAGCCAAGA 

4 01 TGAATATCTT GGTTATTTGT GTGATGATCT TCATTAGTTG GAGTGTCCCT 

451 GCTGTTTTTG CATTTGGAAT GATCTTTCTG GAGCTAAACT TCAAAGGCGC 

501 TGAAGAGATA TATTACAAAC ATGTTCACTG CAGAGGAGGT TGCTCTGTCT 

551 TCTTTAGCAA AATATCTGGG GTACTGACCT TTATGACTTC TTTTTATATA 

601 CCTGGATCTA TTATGTTATG TGTCTATTAC AGAATATATC TTATCGCTAA 

651 AGAACAGGCA AGATTAATTA GTGATGCCAA TCAGAAGCTC CAAATTGGAT 

701 TGGAAATGAA AAATGGAATT TCACAAAGCA AAGAAAGGAA AGCTGTGAAG 

7 51 ACATTGGGGA TTGTGATGGG AGTTTTCCTA ATATGCTGGT GCCCTTTCTT 

801 TATCTGTACA GTCATGGACC CTTTTCTTCA CTACATTATT CCACCTACTT 

851 TGAATGATGT ATTGATTTGG TTTGGCTACT TGAACTCTAC ATTTAATCCA 

901 ATGGTTTATG CATTTTTCTA TCCTTGGTTT AGAAAAGCAC TGAAGATGAT 

951 GCTGTTTGGT AAAATTTTCC AAAAAGATTC ATCCAGGTGT AAATTATTTT 

1001 TGGAATTGAG TTCATAG 



FEATURES: 

Start: 1 
Stop: 1015 



I'd 



HOMOLOGOUS PROTEIN: 

Top 10 BLAST Hits: 

gi I 7657142 1 ref | NP_0554 4 1 . 1 1 G protein-coupled receptor 58 >gi|6. 
gi I 7657140 I ref | NP_0554 42 . 1 1 G protein-coupled receptor 57 >gii6. 
gi I 4505925 | ref |NP_003958 . 1 1 putative neurotransmitter receptor . 
gi|3941547 | gb I AAC82381 . 1 1 (AF069546) putative odorant receptor . 
gi|4028153|gb|AAC96117.1j (AF083221) putative neurotransmitter . 
gi | 6224 984 | sp I 070528 I 5H4_CAVPO 5-HYDROXYTRYPTAMINE 4 RECEPTOR (. 
gi | 3646424 | emb | CAA09599 . 1 | (AJ011370) serotonin 4 receptor [Rat. 
gi | 3326989 | emb | CAA73108 . 1 1 (Y12506) 5-HT4 receptor [Homo sapiens 
gi|3326991)emb|CAA73109.1| (Y12507) 5-HT4 receptor [Homo sapiens 
gi | 3646278 ) emb | CAA0 9600 . 1| (AJ011371) serotonin 4 receptor [Horn. 



317 
316 
264 
226 
210 
209 
208 
206 
206 
206 



8e-86 
2e-85 
9e-70 
2e-58 
le-53 
4e-53 
7e-53 
3e-52 
3e-52 
3e-52 



blast to dbEST: 

gi|7906115|gb|AW813121.11AW813121 CM4-ST0 189-051 099-02 l-h04 ST0... 436 e-120 



Tissue expression for modulatory use: 
EST Expression: 

gi |7 906115|gb|AW813121. 1 IAW813121 : Stomach 



cDNA panel screen: 

Human Brain 
Human fetal Brain 
Human placenta 
Human liver 
Human kidney 



FIGURE 1 



1 MPFCHNIINI SCVKNNWSND VRASLYSLMV LIILTTLVGN LIVIVSISHF 

51 KQLHTPTNWL IHSMATVDFL LGCLVMPYSM VRSAEHCWYF GEVFCKIHTS 

101 TDIMLSSASI FHLSFISIDR YYAVCDPLRY KAKMNILVIC VMIFISWSVP 

151 AVFAFGMIFL ELN FKGAEEI YYKHVHCRGG CSVFFSKISG VLTFMTSFYI 

201 PGSIMLCVYY RIYLIAKEQA RLISDANQKL QIGLEMKNGI SQSKERKAVK 

251 TLGIVMGVFL ICWCPFFICT VMDPFLHYII PPTLNDVLIW FGYLNSTFNP 

301 MVYAFFYPWF RKALKMMLFG KIFQKDSSRC KLFLELSS 

FEATURES: 

Functional domains and key regions : 
[1] PDOC00001 PS00001 ASN_GLYCOSYLATION 

N-glycosylation site 
Number of matches: 3 

1 9-12 NISC 

2 ' 16-19 NWSN 

3 295-298 NSTF 

mjDocoooos PS00005 PKC PHOSPHO SITE 

Protein kinase C phosphorylation site 

327-329 SSR 

m PDOC00006 PS00006 Cia^HQSPHO SITE 
Casein kinase II phosphorylation site 

Number of matches : 3 

1 99-102 TSTD 

2 270-273 TVMD 

3 283-286 TLND 

141 PDOC00007 PS00007 TYR PHOSPHO SITE 
Tyrosine kinase phosphorylation site 

Number of matches : 2 

1 82-89 RSAEHCWY 

2 165-172 KGAEEIYY 

[51 PDOC00008 PS0Q008 MYRISTYL 
N-myristoylation site 

Number of matches: 2 

1 239-244 GISQSK 

2 253-258 GIVMGV 

[61 PDOC00210 PS00237 G PROTEIN RECEPTOR 
G-protein coupled receptors signature 

108-124 AS I FHLS FI S I DRYYAV 



Membrane spanning structure and domains: 
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Begin 


End 


Score 


Certainty 
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27 


47 


1. 


,997 
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1. 
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Certain 
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99 


119 


1. 


.200 


Certain 
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136 


156 


2. 


.270 


Certain 
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189 


209 


1. 


.960 


Certain 
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252 


272 


2. 


.448 


Certain 
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287 


307 


1, 


.057 


Certain 
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BLAST Alignment to Top Hit: 

>gi I 7657142 | ref | NP_0554 4 1 . 1 | G protein-coupled receptor 58 

>gi|6739494|gb|AAF27278.1|AF112460_l (AF112460) G 
protein-coupled receptor 58 [Homo sapiens] 
Length = 306 

Score = 317 bits (805), Expect = 8e-86 

Identities - 155/301 (51%), Positives = 202/301 (66%), Gaps = 13/301 (4%) 

Query: 26 LYSLMVLIILTTLVGNLIVIVSISHFKQLHTPTNWLIHSMATVDFLLGCLVMPYSMVRSA 85 

+YS M I T+ GNL +I+SIS+FKQLHTPTN+LI SMA DFLLG +MPYSM+RS 
Sbjct: 1 MYSFMAGSIFITIFGNLAMIISISYFKQLHTPTN FLILSMAITDFLLGFTIMPYSMIRSV 60 

Query: 86 EHCWYFGEVFCKIHTSTDIMLSSASIFHLSFISIDRYYAVCDPLRYKAKMNILVICVMIF 145 

E+CWYFG FCKI+ S D+MLS SIFHL ++IDR+YA+C PL Y K+ I VI ++ 
Sbjct: 61 ENCWYFGLT FCKIYYSFDLMLSITSIFHLCSVAIDRFYAICYPLLYSTKITIPVIKRLLL 120 

Query: 146 ISWSVPAVFAFGMIFLELN FKGAEEIYYKHVHCRGGCSVFFSKISGVLTFMTSFYIPGSI 205 

+ WSVP FAFG +FE GEY VC CV F-f K+ G FM F+ PGS+ 
Sbjct: 121 LCWSVPGAFAFGAVFSEAYADGIEG-YDILVACSSSCPVMFNKLWGTTLFMAGFFTPGSM 179 

Query: 206 MLCVYYRIYLIAKEQARLISD--ANQKLQIGLEMKNGISQSKERKAVKTLGIVMGVFLIC 263 

M+ +Y +1+ ++++ A I++ NQ Q+ K++KA KTLGIV+GVFL+C 

Sbjct: 180 MVGI YGKI FAVSRKHAHAINNLRENQNNQV KKDKKAAKTLGIVIGVFLLC 229 

Query: 264 WCPFFICTVMDPFLHYIIPPTLN DVLIWFGYLNSTFNPMVYAFFYPWFRKALKMMLFGKI 323 

W P F ++DPFL++ P L D L WFGY NST NP++Y FFYPWFR+ALK +L GKI 
Sbjct: 230 WFPCFFTILLDPFLNFSTPWLFDALTWFGYFNSTCNPLIYGFFYPWFRRALKYILLGKI 289 

Query: 324 F 324 
F 

Sbjct: 290 F 290 



Hmmer search results (Pfam) : 

Scores for sequence family classification (score includes all domains) : 



Model 


Description 


Score 


E- 


-value 
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CE00516 


CE00516 SEROTONIN_RECEPTOR 


208. 


8 


8, 


.5e-59 


3 


PF00001 


7 transmembrane receptor (rhodop 


203. 


1 


2 


.9e-63 


1 


CE00507 


CE00507 DOPAMINE_RECE PTOR 


154. 


5 


1 


.2e-42 


3 


CE00267 


E00267 Octopamine receptor 


133. 


5 


6 


. 6e-37 


3 


CE00503 


CE00503 ADRENERGIC_RECEPTOR 


128. 


1 




8e-35 


3 


CE00510 


CE00510 HISTAMINE_RECEPTOR 


124. 


6 


1 


.7e-35 


3 


CE00323 


E00323 kappa opioid receptor 


49. 


3 


2 


,2e-14 


2 


CE00336 


E00336 orexin receptor 


47. 


3 


7, 


.8e-13 


2 


CE00326 


E00326 mu opioid receptor 


43. 


2 


5 


. le-12 


2 


CE00177 


CE00177 ADENOSINE 


39. 


4 


3 


,8e-10 


2 


CE00205 


CE00205 GH_secretagogue 


37. 


0 


1 


.6e-12 


1 


CE00328 


E00328 tachykinin receptor 


33. 


1 


1 


.9e-08 


2 


CE00513 


CE00513 NEUROPEPTIDE_Y_RECEPTOR 


24. 


9 


1, 


.8e-06 


1 


CE00530 


CE00530 CHEMOKINE_RECEPTOR_TYPE_ 


24 . 


0 


5, 


.2e-07 


2 


CE00266 


CE00266 Conopressin receptor 


19. 


5 


0, 


.00016 


1 
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1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 

n 1201 

1251 

1301 

P 1351 
[J 1401 
!<] 1451 

1501 

V Z 1551 
1601 

in i65i 

1701 

v=> 1751 
!;!? 1801 
58 1851 
O 1901 
I 951 

^ 2001 

!;~ 2051 
^ 2101 

2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 



TGTGTGCCTT GCCCACGGCC CACATGGAAA CTCAGCTTCC CTGGGCTCTC 
ACCTCAGTGC TCTTTGCACC ATGCCACTTC ACTTCTTACC ATAGACTTTA 
GTCTGTATAT CAAAAATCCT TAATAAGTGG GACCACAATA TCATTCTTCA 
CTAGTAGGTG CCATTATTTA AGACATCAAA AATTCTTATA TGAAAATTTA 
AACCTTAATT TTTAAAAATT GACCTCAAGT TGGTTCTTTC TGCAATAGAG 
AGAAAGACTG GACTGATATC AAAGAAACTG AACTAGTAAT TGAGACCACA 
GCTCATATTC AAGCTTTAAT GATTTGTGAA TTAAACTTCA ATTTCTTGCC 
AGTAAAATGA GGTTAATGAA GCATGTCATG TCTTCCTTAC AAAAAATGAT 
GTTGGTCTTA TAAATGTCAA ATAAAACTGT AACTGAAATA ATGCATGGAA 
TATATGTAGA AGTACTTTGC AAGCTGCAAA TAAATATATG TAAGCTTCTT 
CTTATTACAA TTAATTATAA AATAAGAAAT ATATAATTTA CGTATGACCA 
AGTGCAGAAG TGGGCATAGT GCGTATTAAT TCTGCACAAG AGGCACATTT 
AAAAGACTTC TTGTCTTTTT CATAAACATA TTTTAATTCC AAATTTTCTG 
GCAATTTTCT TTCTACCCTT TACAATGTAA GTCTCCCCCA CTAGACTGGC 
TTCCTATGGA CACAGTATTG TTTCCATAGA TTAAGATGTG TGATCATAAT 
CATCATTTGA TCTTCTCTGA CCTACAAATG TGGCAGTTTC ATATAGTTCA 
ACCTAATAAT TTAGCTTCAT ATTTCTAGAA GAATGTCCAG CTCCAGGTAA 
AAGTTTTTTA AAAATTGCTG TACTGAACTA TTGAATGGAA CTTGGAAATA 
AAGTCCCTTC CAAAATAACT ATTCTTCAAC AGAGAGTAAT AGGTAAATGT 
TTTAGAAGTG AGAGGACTCA AATTGCCAAT GATTTACTCT TTTATTTTTC 
CTCCTAGGTT TCTGGGATAA GTATGTGCAA ATAAAAAATA AACATGAGAA 
GGAACTGTAA CCTGATTATG GATTTGGGAA AAAGATAAAT CAACACACAA 
AGGGAAAAGT AAACTGATTG ACAGCCCTCA GGAATGATGC CCTTTTGCCA 
CAATATAATT AATATTTCCT GTGTGAAAAA CAACTGGTCA AATGATGTCC 
GTGCTTCCCT GTACAGTTTA ATGGTGCTCA TAATTCTGAC CACACTCGTT 
GGCAATCTGA TAGTTATTGT TTCTATATCA CACTTCAAAC AACTTCATAC 
CCCAACAAAT TGGCTCATTC ATTCCATGGC CACTGTGGAC TTTCTTCTGG 
GGTGTCTGGT CATGCCTTAC AGTATGGTGA GATCTGCTGA GCACTGTTGG 
TATTTTGGAG AAGTCTTCTG TAAAATTCAC ACAAGCACCG ACATTATGCT 
GAGCTCAGCC TCCATTTTCC ATTTGTCTTT CATCTCCATT GACCGCTACT 
ATGCTGTGTG TGATCCACTG AGATATAAAG CCAAGATGAA TATCTTGGTT 
ATTTGTGTGA TGATCTTCAT TAGTTGGAGT GTCCCTGCTG TTTTTGCATT 
TGGAATGATC TTTCTGGAGC TAAACTTCAA AGGCGCTGAA GAGATATATT 
ACAAACATGT TCACTGCAGA GGAGGTTGCT CTGTCTTCTT TAGCAAAATA 
TCTGGGGTAC TGACCTTTAT GACTTCTTTT TATATACCTG GATCTATTAT 
GTTATGTGTC TATTACAGAA TATATCTTAT CGCTAAAGAA CAGGCAAGAT 
TAATTAGTGA TGCCAATCAG AAGCTCCAAA TTGGATTGGA AATGAAAAAT 
GGAATTTCAC AAAGCAAAGA AAGGAAAGCT GTGAAGACAT TGGGGATTGT 
GATGGGAGTT TTCCTAATAT GCTGGTGCCC TTTCTTTATC TGTACAGTCA 
TGGACCCTTT TCTTCACTAC ATTATTCCAC CTACTTTGAA TGATGTATTG 
ATTTGGTTTG GCTACTTGAA CTCTACATTT AATCCAATGG TTTATGCATT 
TTTCTATCCT TGGTTTAGAA AAGCACTGAA GATGATGCTG TTTGGTAAAA 
TTTTCCAAAA AGATTCATCC AGGTGTAAAT TATTTTTGGA ATTGAGTTCA 
TAGAATTATT ATATTTTACT GTTTTGCAAA TCGGTTGATG ATCATATTTA 
TGAACACAAC ATAACGAACC ACATGCACCA ACCACATGGA TTTTTTTTTA 
AATCAGTTAC TTGAGTCAAA GTATGTATGG TGAGTTAAAT TATGATGCTT 
ATAGGTAATT TCCTATTTGG GACATAGTAG GTATACGCTT TTCCATTCTT 
ACCACACATA ATGGAACTTT GCAAATCCAG TATTTAAAGG CCTACATTTT 
ATATAACTTT TCCTGCCCTT AGAAGAACTG CCATGAGTTT ACTGTGGTAC 
CTTAGTCAGC TGTTCAGTGG TGGAAACTAT AGGGCTGAAT TTGAGGATGC 
AAATCAGAAT GATTTTGTCA TATACGTATA CATGTCCTGG TCACATATGT 
GTATAAGTCT TTTAGTAAAA TCAACTAAAA GTACACACAT TAAATGTTAC 
CGCTAAATAA TTGTTGACAC ATAATTTATT TGTATTGTTA AT AT AT ATT C 
CAATCCTTAA ATATCTTGAT CTTCACATTT TTAAATTATG TTTTTCTGAT 
TCTCATGCTT CTAAGAAATT TGAAAAATAA AGGAAAATAT AAAAATAAAA 
TATATACAAA CCAAATGAAA TTAAAAAAAT GTTATTCATA GTATACTTCC 
TGGTAAGGAT TATATCATCT AAAATTCTTT ATTTTATATT AATATTTCTC 
TTTTTCAACT TTTATTTTGG AGTCGGGGCA TATGTGCACA TATTAACTAA 
ATATAATGCA TGATGCTGAG GTTTGAGCTA TGAGTAATCC CTTTTTGTGG 
CTGTATAACG TGACTGCACA GATGTACAAA TATATTTTTA ACCATTCCCT 
ATTGTTAATT CCAGTTGTTT CTAACTTTTC TAAAAATAAT ATAAATTAAA 
TGAAATTCCT GCTTTTTACC TGCCACTGAA GTCATGAAAA TGTCTAGAAG 
GATTTTACCA TGTCTTAAGG TCATATCTGG CATGATAGGG TTCAAAACAC 
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3151 AGGCTACCTG GTATTAAACA AATTCACTTT GCTGGGCTCC ACAATACACA 

3201 GAAGGAGAAG CAGTCATCCA TCAAAGTAGC TAAACATGAG GGCCAACAAG 

3251 AAGTATAATC AGACTAGATG TACCATGGCT ATTTAGATGG CAT AT AT AAA 

3301 AATACAAAAG AGGAACAAAT AATGGTTTCA AATAAGATTC TCTAAAGGAA 

3351 GTGGGCAAAC ATTCTAAATT ACAAGCATTC ATCAGCAATT GAGCTGATGA 

34 01 ATACAAACTT CACATGGTCT GTTTCAGATT GAGTATCACC GGGGATTTGG 

34 51 ATTTACCTTC TGCTCATGAA TAAGTTAGGG TAAGGCAAAT GATTTTAAAC 

3501 G 



FEATURES: 

Start: 1137 
Exon: 1137-2153 
Stop: 2151 



Chromosomal map position for allelic variants: 
RH Map: 

11000250647308 # SHGCNAME CHROM# LOD SCORE DIST. (cRs) 
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SHGC-1836 


6 


13. 


,79 


5 


2 


SHGC-12753 


6 


9. 


,94 


19 


3 


SHGC-1877 


6 


9. 


,41 


13 


4 


SHGC-8456 


6 


9. 


,28 


21 


5 


SHGC-12805 


6 


9. 


,22 


21 



SNP: 

Allele 1 Allele 2 Context 

POSITION 

3440 c t Tgagctgatgaatacaaacttcacatggtctgtttcagattgagtatcac [c/t] ggggatttgga 

tttaccttctgctcatgaataagttagggtaaggcaaat 
2525 c t Aactatagggctgaatttgaggatgcaaatcagaatgattttgtcatata [c/t] gtatacatgtc 

ctggtcacatatgtgtataagtcttttagtaaaatcaac 



Allele 1 Allele 2 Protein 

POSITION 



3440 c t Intron 

2525 c t Intron 
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